RAGify Your Skills

>

Hands-on Workshop with

Python, FastAPI, and Azure Services

1

2

Introduction to RAG

Estimated time: 30 minutes

by Kevin Lin, Andreas Mueller

3

Start anything we do by first asking the Why questions. Understand the purpose, goal of what we do, before moving on to How and What. Company perspective: Why is RAG relevant to Zuhlke's AI strategy? Individual perspective: Why should I learn and care about RAG as a software engineer?

4

Retrieval Augmented Generation (RAG)

An NLP approach that combines the strengths of retrieval systems and generative models.

Retrieving relevant information from external knowledge sources and using it to generate more accurate and contextually relevant responses

5

Using an LLM in a Q&A Setup

6

Q&A solution using RAG

7

Strength of RAG

Enhanced Accuracy and Reliability

• Grounded Responses: RAG models retrieve factual information from trusted sources, grounding their outputs in real data and reducing incorrect answers (Lewis et al., 2020). • Reduced Hallucinations: By accessing external knowledge bases, RAG minimizes the generation of fabricated information.

Up-to-Date Information with Improved Contextual

• Dynamic Knowledge Integration: RAG can incorporate the latest information without retraining the entire model, ensuring responses reflect current data (Lewis et al., 2020). • Adaptability: Models can quickly adapt to new domains by updating the retrieval database. • Contextual Relevance: Accessing relevant documents during generation leads to more contextually appropriate responses. • Disambiguation: The retrieval process helps resolve ambiguities by considering a wider context.

Efficiency in Knowledge-Intensive Tasks

• Scalability: RAG handles large data volumes efficiently, suitable for applications like customer support with vast knowledge bases. • Domain Specificity: Tailoring the retrieval corpus allows for industry-specific applications.

8

Challenges with RAG

Quality and Bias of Retrieved Data

• Dependence on Data Sources: Accuracy relies on the quality of external knowledge bases (Lewis et al., 2020). • Bias Amplification: Potential propagation of biases from the retrieval corpus.

Integration and Maintenance

• System Complexity: Integrating retrieval systems adds architectural complexity. • Continuous Updating: Ongoing maintenance is required to keep knowledge bases current.

Privacy and Security Concerns

• Data Leakage: Accessing external data raises concerns about exposing sensitive information. • Compliance Risks: Ensuring adherence to data protection regulations like GDPR.

9

Workshop Goal

> To implement a reference RAG application that retrieves and generates based on documents stored in Azure DevOps.

Full details available on the Wiki: Workshop Documentation

10

Use Case

Test our RAG application with Zühlke insurance documents to evaluate its effectiveness.

Is teeth cleaning covered?
Are shampoos or moisturizers for eczema covered under the policy?
Are drugs purchased without a doctor’s prescription covered under the policy?
Is a referral letter required before seeing a specialist?
What should I do if I need an MRI after visiting A&E?
What ward am I covered for hospitalisation?
What is the coverage limit for specialist consultations?

11

Key Infra Service Required

Azure Cloud supports developing RAG by providing tools for managing LLMs, databases, and integrating them through APIs:

Azure AI Search - Vector Database for retrieval
OpenAI models - LLM for embeddings and generation
Azure Blob Storage - Document storage

The workshop's purpose is not to teach you how to set up Azure services, but to provide a working environment for the workshop.

12

>

Live Demo with Front-end

>

13

How do we start?

High-Level tasks for everyone:

Setup Environment: Setup environment and dependencies for the backend application
Document Ingestion: Load, chunk, and embed documents for the Vector DB
Query and Retrieval: Implement retrieval logic to query the Vector DB using embeddings
Frontend Integration: Run the frontend and integrate it with the backend to test the application

14

How do we start?

Find your partner or team, and work together (Pair or Trio)
Use the Parking Lot for questions that arise or reach out to us
Follow the agenda

15

Agenda

Day 1 (Thu)

Introduction to RAG - 30 mins
Setup Environment (hands-on) - 60 mins

Break - 30 mins

Ingestion (hands-on) - 60~90 mins

Day 2 (Fri)

Retrieval (hands-on) - 60 mins
Put It All Together (hands-on) - 60 mins

Break - 30 mins

Forward and Beyond - 30 mins

16

> The best way to learn something is to do it.

- Aristotle

17